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SUMMARY 

We use analytical examples and asymptotic forms to examine the mathematical structure and 
physical meaning of the seismic cross correlation measurement. We show that in general, cross 
correlations are not Green's functions of medium, and may be very different depending on the 
source distribution. The modeling of noise sources using spatial distributions as opposed to 
discrete collections of sources is emphasized. When stations are illuminated by spatially com- 
plex source distributions, cross correlations show arrivals at a variety of time lags, from zero to 
the maximum surface-wave arrival time. Here, we demonstrate the possibility of inverting for 
the source distribution using the energy of the full cross-correlation waveform. The interplay 
between the source distribution and wave attenuation in determining the functional dependence 
of cross correlation energies on station-pair distance is quantified. Without question, energies 
contain information about wave attenuation. However, the accurate interpretation of such mea- 
surements is tightly connected to the knowledge of the source distribution. 

Key words: Theoretical Seismology - Wave scattering and diffraction - Wave propagation. 



1 INTRODUCTION 

Terrestrial seismic noise is generated at a range of temporal f requencies, by human activity, storms, oceanic wave microseisms (e.g. , 
Longuet-Higgin s. 1950; K edar & Webbiboodlstehlv et 



mporal frequencies, by human activity, storms, oceanic wave microseisms (e.g. , 
all l2006h and the ocean-excited low-frequency hum of Earth (e.g., Nawa et al., 1998; 



Rhie & RomanowiczL 12004) . Seismic noise is as used as a compe lling alternative to earthquake tomography to image the crust. Most impor- 



tantly , it enables the stud y of temporal variati ons of the crust (e.g., IWegl er & Sens-Schonfelder, 2007; Brenguier et al., 2008; Zaccarelli et al 



2011 



Rivet et al 1120111) and volcanoes (e.g. jBrenguier et all 120071) . The cross correlation measurement has a physical flavor that is intrin- 



sically different from the classical tomographic analog, i.e., wavefield displacement. In particular, the time variable in classical tomography 
is the propagation delay between the source and the station whereas the time lag in cross correlation tomography is connected to the path 
difference between the source and the two stations. 

2010h 



Under controlled circumstances, such as a when the source distribution is uniform, representation theorems fe.g. jFleurv et al 



allow for the cross correlation to be written as a modulation of Green's function between the stations. In other words, such theorems state 
that an equally weighted sum over Green's functions between every source (over all space; constant amplitude) and the stations is equivalent 
to Green's function between the stations. However, Earth noise is typically anisotropic and in such a scenario, Green's functions along some 
source- station paths are weighted more strongly than others and the elegant correspondence may be lost. Further, the climate is in continuous 
flux, and the manner of excitation of seismic noise by, e.g., ocean waves, changes through the year (e.g., Stehlv et all 2006h . Seismology is 
a precision science and consequently, modeling the source distribution and its effect on the cross correlation is critical. 

The study of terrestrial seismic noise has strong connections with the seismic wavefields of stars, and in particu lar, the Sun. The use 
of cross correlations of the wavefield of the Sun to probe interior solar structure was pioneered by Duvall et all ( 1993 ) in a landmark paper. 
The formal interpretation of these measurements had to wai t till the advance by Woodar dl (1997), who laid the theory of cross correlation 
on a formal statistical foundation. A number of years later, bizon & Birchl J2002n . based on this work, were ab le to compute kernels for 
cross correlations of helioseismic noise arising from a distribution of sources. However, Gizon & Birchl d2002[) did not account for 3-D 
heterogeneous backgr ounds and the rewriting of their formalism in the language of adjoint methods was led bv lTromp et al. ( 2O10h for the 
terrestrial case and by lHanasoge et al.1 (1201 lh for the helioseismic scenario. One useful concept t hat emerge d from these articles is that of 
dealing with source distributions as opposed to a discrete number of them (e.g., Larose et al. . 20061 : TsaiL 2009h . More importantly, the results 
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of Tromp et all i2O10l) enabled the computational prediction (in a forward sense) of cross correlations based on a given Earth model and 
source distribution (this problem has received considerable attention: e.g.,[P edersen et al., 2007; IChevrot et all 120071 : 1 Yang & Ritzwollei . 



source distribu tion (this problem ha s received considerable attentio n: e.g., Federsen et al., 
20081 : IWeaver et all 12009 : ICupillard & CapdevillelJ201ol : lTsai[l201ol : lFroment et allboich . 



In this article, we discuss some of the concepts underlying the cross correlation measurement using a simple 2-D example. Section [3 
deals with the cross correlation and its connection to the source distribution. Storms, which excite seismic waves, but sometimes physically 
move substantial distances over a span of days (i.e., over the measurement window), can be modeled as well, albeit through a more complex 
ansatz for the distribution. We also introduce the basic partial differential equation governing the wavefield and Green's function for the 
simple case studied here. The analysis of the variations of the cross correlation due to the changes in the source distribution, thereby leading 
to the sensitivity kernel are discussed in Section [3] In particular, its asymptotic form reveals the structure of the source- amplitude kernel. 
The cross-correlation energy misfit and its kernel are discussed and computed semi-analytically in Section ITT1 Based on this formalism, the 
impact of non-uniform source distributions on cross correlations is examined and we make a case for imaging of the source distribution and 
briefly discuss the limitations in Section [3T2l 

The operator formulation of the adjoint method discussed by, e.g., Ficht ner et al. I i2006h . which works elegantly for classical tomography, 
unfortunately does not naturally apply to higher order measurements. In classical tomography, we vary the wavefield, which is directly the 
solution to the wave operator. To create an equivalent operator formalism for cross-correlation tomography, one needs to write a differential 
equation for the cross c orrelation itself, which is impractical. Cons equently, we must carry out the Born expansion by brute force and analyze 
the resultant terms (e.g., Tromp et al. , 2010l : Hanasoge et al. . 2011 ). as discussed in Section|4]of this article. Three Green's functions appear in 
this expansion (as opposed to two in the classical tomography case) and their role in modeling scattering is elucidated. The source distribution 
plays a critical role in determining cross-correlation energies. In order to accurately interpret cross-correlation energies in the context of 
wave attenuation or scattering, the source distribution must be well known, as discussed in Section [5] Indeed, once the effect of sources has 
been accounted for, cross-correlation energies contain information about attenuation. In a dense network, the sensitivity to attenuation and 
scattering is primarily restricted to the region within the network because the hyperbolic features that appear in cross-correlation kernels for 
attenuation (e.g., see Figure 6 of iTromp etalll2010h cancel. We conclude in Section[6] 



2 FORMAL INTERPRETATION 

Cross correlations of seismic noise fluctuations 0(x, t), denoted by C a/ g(t), are defined as 



C a p(t;T)= [ dtXxa^'J^x^t'+t), (1) 
Jo 

where T is the temporal length of averaging, t is time and x Q , are spatial locations at which measurements are made. In order to obtain 
cross correlations of a reasonable signal-to-noise ratio, T must be on the order of several source correlation times and wave travel times 
between source- station pairs. As the temporal window of averaging T grows, the cross correlation approaches a limiting value (provided 
the source distribution and the medium do not change subs t antiall y over this time scale), i.e., what we term the expectation value. This 
was labelled the ensemble cross correlation bylTromp et all (Eoio|) in order to describe ensemble averaging over many source times (and 
realizations; see also, e.g. jLarose et alll2008l : ICupillard & Capdevillell20ld for convergence studies). Moments of stochastic processes for 
which expectation values exist are termed ergodic. Terrestrial seismic noise is ergodic because wave excitation of oceanic origin appears to 
have well behaved statistics (e.g., when the sources are described by a Gaussian random process). 

The relation dA.51 > when applied to equation ([TJ allow us to describe the cross correlation in temporal Fourier domain 

C a p(u) = 0*(x a ,cj) <p(x.p,u). (2) 

Denoting the limit (or expected) cross correlation by (C a p(uj)), we have 

(C a p) = (0*(x a ,cj)0(x /3 ,cj)). (3) 

The wave equation we consider here is 

pdU- V-(c 2 V0) = S(x,t), (4) 

where p is density, x = (x, y) is a 2-D flat space, t time, (f> the wave displacement, V = (d x , d y ) the covariant spatial derivative, 5(x, t) 
the source and c wavespeed. For the simple case considered here, we assume constant wavespeed c. Green's function G(x, x'; t) for the 
displacement at (x, t) due to a spatio-temporal delta source at (x', 0) is the solution to 

(pdt - c V 2 )G(x, x; t) = <5(x - x')6(t). (5) 

This equation is explicitly solvable; G reen's function in temporal Fourier- transform (according to the convention defined in appendix [A]) is 
given by (e.g.. I Aid & Richardsl[l98ol ) 



G(x,x', W ) = .ff< 1) (^|x-x'|) ) (6) 
where uj is temporal frequency and is the Hankel function of the first kind. This is also approximately the surface wave portion of 
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Figure 1. Cross correlations from a general model of sources as stated in equation (|9j. The source at x excites waves that propagate to a and the source at 
x' generates waves that propagate to /3. Because the sources themselves are statistically correlated, the expectation value of the wavefield cross correlation 
measured at points x a ,x^ is non-trivial. Note that if the sources at two spatial points were statistically independent, the expectation value of the cross 
correlation would be zero when x ^ x ; , as shown in Figure[21 This form of source distribution is useful in modeling, e.g., storms, which can move substantial 
distances over relatively short times. 



Green's function for a laterally homogeneous Earth. To lighten notational burden, we cease to explicitly state frequency u unless required. 
The wavefield 0(x) excited by sources S(x.') is described by 

0(x) = y(ix / G(x,x / )5'(x / ). (7) 
The correlation in Fourier domain (0 may be rewritten in terms of Green's functions and sources 

C aP (tj) = J dx' J dxG*(x a ,x)G'(x^x / )^*(x)5(x / ), (8) 
and the expected cross correlation becomes 

(C«/j> = J dx' J dxG*(x c „x)G'(x^x / )(^*(x)5(x / )), (9) 

where the ensemble averaging has been brought into the integral and placed around the source terms. This is the point at which we have 
moved from treating dynamically evolving sources to studying their statistics. Thus we have taken a system whose source distribution is 
unknown and posed it in terms of a (potentially) computable statistical theory. Equation (HJl states that a wave excited at x propagates, 
through a medium described by Green's function, to point x a and similarly form x' to x^, pictorially depicted in Figure Q] Contributions 
from wave sources over all space are summed to produce the wavefield at points x a , x^, which explains the spatial integrals. For a complete 
theory, we need to include the statistical spatial covariance of the source distribution, i.e., A(x, x', uj) = (5*(x, uj) S(x.' , cj)), but such a 
problem is very hard to study. Consequently, we model spatially uncorrelated sources, i.e., A(x, x', uj) = V(oj) <j(x) <5(x — x'), where V 
is the power spectrum and <j(x) is the source distribution in space. This choice greatly reduces the number of degrees of freedom in any 
eventual inverse problem (see Figure [2]). Note that the source distribution typically varies as a function of frequency, and this can be modeled 
by studying narrowly filtered cross correlations such that we may invert for a different spatial distribution in each frequency window. Different 
parametrizations of A may be chosen depending on the problem at hand. Using this assumption, we have 



(C a p) = J (ixf (x tt , x) G(x/3, x) V(uj) cr(x), 



(10) 



which allows us to construct forward models of cross correlation, a first step towards inversions. Substituting Green's function j6]l into i\0l . 
we obtain 



(C Q/3 ) = J dx//< 2) (^|Xa - x|) (^| X/3 - x|) Pa(x) 



(11) 



where the Hankel function of the second kind is defined thus — Hq 1 ^* . For convenience, we also define A a = |x Q 



4 Shravan M. Hanasoge 




Figure 2. Cross correlations due to the distribution with (S* (x, uj) S(-x.' ,uj)) = V(oo) cr(x) £(x — x'), as described in equation fTol . This model renders 
feasible the prediction or forward computation of the cross correlations because of the reduction in the number of integration variables. The inverse problem 
is also more easily dealt with since the number of degrees of freedom is much smaller. 



3 STRUCTURE OF SOURCE KERNELS 

Sensitivity kernels for noise distributions were introduced by Tromp et al. i2O10K who termed them ensemble kernels. Consider the inverse 
problem where we are interested solely in the source distribution, i.e., variations of the correlation are rooted only in variations of the source 
distribution (as opposed to a more complete inverse problem which would contain variations to structure as well) 



Suppose the measurable 5F, such as a travel time or energy, is locally a linear functional of the variation of the cross correlation, i.e., 
where W a p(uj) is some weight function. Then we have 

where it may be seen that the term within the square brackets is the source kernel i^ a/ g(x) 

which is the sensitivity to variations in the amplitude of the source distribution. With a little help from asymptotics, we can immediately 
perceive the structure of this kernel. The far-field approximation to Hankel functions, i.e., for large values of the argument, is 

2 



(12) 



(13) 



(14) 



(15) 



exp 



(16) 



Thus, were there to be no source activity at very low temporal frequency, at a distance of several wavelengths away from the measurement 
locations x Q) ^, we may write 



(1) (u 



—A/ 



2c 



(2) (U 



c 



TVUjAf 



2c 

7TUjA a 



exp A£ 



. 7T 

l 4 



exp 



— A a +i- 

c 4 



Substituting these asymptotic relations into the expression for the kernel in equation il5l * we obtain the far-field limit 



if a/3 (x) 



— -f 



duo f{uo) exp 



(17) 
(18) 

(19) 
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Figure 3. Contours of constant path difference are hyperbolae whose foci are the two stations x a , x^. For a given time-lag measurement of the cross correlation 
(i.e., choice of measurement time window), sources along the hyperbola constructively contribute to it. 



where f(u) 



4 c W*p V /u). Comparing this to the definition of the inverse Fourier transform dA.2K we conclude that 

' A a - A/A 1 



(20) 



VAcA? 

and the source-distribution kernel is also a function of the path difference between a given point and the two stations, shown in Figure[3] This 
relationship suggests the exi stence of hyperbolic features in the kernels, i.e., contours along which (A Q -A^)/c= ( I x — x <*| — | x — x.p\)/c = 



C is constant (also see, e.g. JSniederu2004l ; lRoux et all 120051 ; ICupillard et alluOllh . The presence of the term 1 / y/ A a Ap results in a much 
greater sensitivity to regions close to the station and the path joining the stations. In comparison, the source kernel possesses relatively 
weak sensitivity to areas away from this line. Since we have assumed that the source distribution is spatially uncorrelated, contributions that 
constructively add to the expected value of the cross correlation can only be from the same point in the source distribution. In other words, 
a source excites a waves at x, which propagate to x a)J g as described by Green's functions between the two stations and the source location. 
The cross correlation thus registers the travel-time difference between a source point and the two stations, which remains constant along 
hyperbolae whose foci are the two stations. Source kernels are relatively easy to compute since there are no scattering terms. When including 
scattering, variations of Green's function come in to play, and a Born expansion is required to determine the variation of Green's functions. 

An interesting analogy to note is that between scattering in the classical tomographic case (banana-doughnut kernels) and the source- 
amplitude kernel in the cross-correlation scenario. We will not go into mathematical detail but in both of these cases, two Green's functions 
participate in the construction of the kernels. The sole difference between these two scenarios is that the source- amplitude kernel consists 
of a correlation betwee n two Green's functions and the scattering kernel is composed of a convolution between Green's functions (e.g., 
Marquering et al. , 1999h . The source- amplitude kernel in the cross correlation case is thus analogous to anti-causal scattering in the classical 
case since one of the two Green's functions has a complex conjugate in the cross-correlation case. The correlati on/convolution difference 
changes the character of the kernel: elliptical features are observed in classical banana-doug hnut kernels (e . g., Marquering et all 1 1999h 
whereas hyperbolic features are seen in the cross-correlation source- amplitude kernels (also see, Gizon & BirchL 2002h . Equation dTSb for the 
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classical scattering analog would then be 

*£f (x) = J ctoW^H^ (^A a ) (^A P ) , (21) 

where now this represents a convolution (note that both are Hankel functions of the first kind) and where K^p (x) is a classical scattering 
kernel. Applying the same asymptotic analysis, we obtain 

which will produce elliptical features since these are contours of constant path length, i.e., where (A Q + Ap)/c = ( |x — x a | + |x — |)/c = 
C. 



3.1 Computing Kernels 

In this section, we discuss the structure o f source- amplitude ker nels through the exact computation of equations (UTT) and dT5l . The formulation 



used in this section follows from that of lDahlen & Baigl (|2002). We begin the process of computing kernels by defining a misfit functional x 
in terms of the measured energy anomaly, 



A sy * 



In -r^r I , (23) 



where the energy is defined as 



_ i jdtwjt) (c a pdw _ 1 1 r Hij(r# ur ; 



(24) 



where is the windowing function, C a p is the windowed cross correlation and T — f dtw(t). Note that we use the term energy 
interchangeably with amplitude. To preserve simplicity, we do not apply frequency filters, although they may be easily included. In general, 
although we may compute sensitivity kernels for other measurements such as travel times, we restrict ourselves here to the cross correlation 
energy. With a little manipulation, not shown here, the variation in misfit is given by 

Sx = ~ 5 ('" S) W = " 5 Ge) ' ( ln S) / *- «•> <iC "»»' 

If one were to, as before, assume that the variations in the cross correlation only arose from changes to the source distribution, 

(SC a p} = J dxG*(A a )G(Ap)V(uj)Sa(x). (26) 



Thus defining the weight function as 



variations to the misfit functional are then described by 



S ("§)/' 



S X = -2^\fo -j^e J / d*K a p(*) fo(x). (28) 
The kernel normalization is tested by confirming that the following integral is satisfied 

J dxK aP (x)a(x)= (^y^j ^ J <ko{C* a/3 } J dxG*(A a )G(A^)PHa(x) = l (29) 

(obtained upon applying definition J24t for the energy and equation dTTTb for the expected cross correlation). 

We compute source- amplitude kernels (around a uniform distribution a = 1) in the temporal Fourier domain, using the exact functional 
form of Green's function (|6]l. The wavespeed is set to c — 1 km/s. The expected (limit) cross correlation contains symmetric positive and 
negative branches. The power spectrum and its temporal representation are shown in Figure |H We use a temporal grid of 401 points and the 
frequency spacing of 0.05 Hz, and hence a time window of 20 seconds, as in Figure HI The spacing in the temporal grid is 0.05 seconds, 
implying a Nyquist frequency of 10 Hz. In order to compute the integral over frequency in equation (TTSb , we precompute Hankel functions 
on a grid of 681 x 681 points resolving a square of size [—40, 40] x [—40, 40] km 2 . These function values are then utilized to compute the 
expected cross correlation and kernels. For the examples in Figure[5] we choose <j(x) = 1, i.e., a uniform distribution. In Figured we show 
examples of how the choice of the measurement affects the sensitivity kernel. On the upper panels, the expected cross correlation for a point 
pair separated by a distance of 10.6 km. The dashed box indicates the choice of measurement, a 4 second window on the left panel and a 
0.5 second window on the right. Kernels corresponding to these choices are shown immediately below. A thicker hyperbola, indicative of a 
broader range of path differences, is seen on the kernel to the left (compared with the thinner hyperbolae on the right). Because we choose 
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Figure 4. Source-time function (upper panel) and the power spectrum V{uj). 



only the positive branch, the kernel shows sensitivity only to waves that first arrive at the station on the left and subsequently to the one on 
the right. Consequently, the hyperbolae point to the left. 

In general, the source- amplitude kernel depends indirectly on the choice of the initial spatial distribution of sources. Firstly, the cross 
correlation is obtained by evaluating an integral involving the spatial source distribution over space (Eq. lUTI ). This predicted cross correlation 
is then used in the computation of the kernel (Eqs. llT"5l and (27) ). The function W a p, from equation (E7K assigns a frequency-dependent 
weight to the two Green's functions in the integral ( fT5l ). But since Green's functions are an inseparable mix of frequency and space (see 
Eq. (6)), the kernel resulting from the evaluation of JT5l > will possess a spatial dependence that reflects the source distribution. 

Source- amplitude kernels as a function of interstation distance are graphed in Figure [6] It is seen that at small distances, the lateral size 
of the kernel is comparable to the interstation distance whereas at very large distances, the sensitivity is restricted to a small range of azimuths 
around the interstation path. There are disadvantages to using measurements at stations separated by small distances since they are only able 
to image the source distribution in their vicinity, leading possibly to errors in the inversion. Further, if the distance between the stations is 
less than a wavelength, the cross correlation does not provide very much additional information and should be removed from the analysis. 

In Figure [7] we show kernels as a function of interstation distance and the temporal frequency of the measurement. Higher- frequency 
waves lead to kernels with greater complexity and spatially sharper features. Much as in classical tomography, finer-scale images of the 
source distribution may be obtained by using higher- frequency measurements. It is unclear how useful this will be when attempting to invert 
for oceanic microseisms since they occur at narrow frequency bands. In other words, if one were to incorporate higher or lower frequency 
measurements, away from the central microseism excitation frequency, would be of very limited utility in imaging. A more thorough study 
is needed in this regard. 
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One way source kernel, station pair distance = 5.3 tyri|o" 3 
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One way source kernel, station pair distance = 5.3 kjri|Q 4 



1 1] 



Figure 5. Expected cross correlation (upper panels) and the attendant kernels (bottom panels). Stations are marked by the diamond symbols. A wider measure- 
ment window implies that the the hyperbolae are thicker, as can be seen upon comparing lower left and middle panels. Note that when the window straddles 
the zero time lag part of the cross correlation, the kernel will show sensitivity to sources along the bisecting line perpendicular to the path between the stations 
(bottom right). The part of the kernel along the line joining the two stations (i.e., y = 0) is sensitive to parts of the cross correlation corresponding to late times 
whereas the hyperbolae closest to the bisector (i.e., x = 0) are due to the zero-time-lag part of the cross correlation. The color scale has been saturated so as 
to enhance the visibility of the relatively weak hyperbolae. 



3.2 Non-uniform source distribution and the event kernel 



The presence of strong non-uniformities can render inaccurate results pertaining to the correspondence between the cross correlation mea- 
surement and Green's functions along the station pair. The integral in (ITU is over all space and provided the weight function cr(x) is also 
uniform, the expectation value of the cross correlation is well behaved, displaying features similar to classical tomographic arrivals, an in- 
stance of which is shown on the upper panels of Figured] In fact, for the case considered here, the expected cross corr elation can be shown 
to be a frequency modulation of Green's functions along the path (without wave attenuation; see Tromp et al. . 2010h . However, when the 
source distribution a becomes more non-uniform, the expectation value of the cross correlation shifts away from the elegant Green's function 
analog and adopts more complicated forms. A particularly stark example is when the sources lie along the bisector line perpendicular to the 
path between the station-pair: the cross correlation in such a case will be centered around zero time, since t he path difference from the source 
to the stations is zero. We also consider a situation that has been studied extensively in past literature (e.g. JPerode et alll2003l : lLarose et all 
2OO6: lFleurvetalll2O10h . namely that of a ring of sources surrounding a station pair. Indeed, we find in Figure |9] that the cross correlation 
owing to a uniform distribution of sources is almost identical to that in the ring of sources scenario. As opposed to a discrete number of 
sources placed at a certain radius around the station pair, we use a continuous annulus to represent the ring. The amplitude of the uniform 
distribution is substantially smaller in order that the cross correlations from these two situations have the same energy. 

In Figure [To) we make the case for the imaging of anisotropic source distributions. The "true" distribution is shown on the left panel. 
The starting "synthetic" model consists of a uniform distribution of sources whose amplitude is the same as that of the "true" distribution 
away from the local spot of increased amplitude on the south-east quadrant. We use the energy of the full envelope of the cross correlation, 
from zero to the classical surface- wave arrival time. We compute the misfit according to equation (l23l) . The event kernel associated with a 
particular station a is a sum of the point-to-point kernels between that station and all other stations, i.e., 



K« (x) = V In K aP (x) 



(30) 



where A ohs signifies the "true" energies. T he sum of all the ev ent kernels associated with a network of stations provides a way to update the 
model, and is J2 a K a (x). As discussed in lTromp et al. the cost of the inversion scales with the number of "master" pixels a and is 

independent of the number of "slaves" f3. However because we use a translationally invariant background model, the computation of kernels 
is cheap and so we may include as many master pixels as we desire. In this case, we stop at 20 stations, i.e., 20 master and 19 slave pixels. 

In Figure [Toj we show the sum of event kernels corresponding to this set of "data" and "synthetics". The kernels neatly focus onto the 
area where the source amplitude is locally large (by 500% in comparison to the value away from this spot). In order to image this localized 
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Figure 6. Source kernels as a function of interstation distance for a uniform source distribution. The measurement is the energy of the entire waveform. At 
small distances, the lateral size of the kernel is comparable to the interstation distance whereas at very large distances, the sensitivity is restricted to a small 
range of azimuths around the interstation path. 



spot better, we would require a greater coverage by the array, i.e., an array that surrounds the spot, as shown in FigureQj] Note that the farther 
away the anomalous source activity is from the array, thus diminishes the ability to discern their location. An example of such a situation 
is shown in Figure Q21 One may interpret it as the sources being far enough away that when the waves arrive at the stations, their curvature 
(oc A^ 1 , As being the distance from the source) is so small that they appear as plane waves, and information about the source location is 
thus lost. In order to perceive such small curvature, a network of stations placed far apart (oc As) becomes necessary. An additional reason 
for the spatially localized sensitivity is that the source kernel is of greatest amplitude along the interstation paths and in the vicinities of the 
stations. Conceptually, imaging of the noise source distribution is not very different from that of inverting for an earthquake source; both 
require appropriate choices for measurements and a good network of stations. Subsequently, by studying the inverted source distribution, 
we may arrive at t he conclusion that t he distribution is too far away to image. In this case the procedure is essentially no different from 
beamforming (e.g., Stehlv et all 2006h . However, if there were to be more information in the wavefield (by using different time windows 
and some intrinsic curvature properties of the waves), then this method will be able to utilize it to produce better quality images of the noise 
source distribution than beamforming. 

Rapid temporal variations in the source distribution may also be lost when the cross correlations are averaged over long times. However, 
this is independent of the technique used, i.e., beamforming or an adjoint method (used here). Therefore it is useful to apply the sorts of 
methods described here since it maximally utilizes wavefield information. 



4 SCATTERING 

In the framework of correlation tomography, scattering kernels can be substantially more complicated. They are also intrinsically different in 
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Figure 7. Source kernels as a function of frequency of the measurement. The source-time functions used in computing the two sets of kernels are plotted on 
the bottom two panels. The measurement is the energy of the entire waveform. When we use a more rapidly varying source-time function (right column), the 
kernel shows greater complexity and spatially sharper features. Much as in classical tomography, finer-scale images of the source distribution may be obtained 
by using higher-frequency measurements. 



flavor from classical banana-doughnut kernels, containing additional hyperbolic features which represent sensitivity to sources at disparate 
spatial locations. Much as in FiguresQ]and[2l we attempt here to graphically explain the physics of scattering kernels for noise measurements. 
Variations to the limit cross correlation ^ are given by 

(SC aP ) = (f(Xa) tyfo)) + (<fy*(Xa) 0(X/j)) + 0(6^), (31) 

in which, keeping with convention, we do not explicitly state the dependence on frequency uj. Hitherto, we have ignored scattering terms but 
in this section, we describe their mathematical structure. For a given wave operator C and the corresponding wavefield cj) satisfying 

Ccj) = 5, (32) 

the first Born approximation (e.g., Hudson! . 1977 : Wu & Akil 1985 ) describing the singly scattered wavefield 5cj) owing to perturbations to 
the operator, SC, is 

S(C(/)) = C54) + 5C4) = 0, (33) 
where we assume that the source distribution is known exactly. We have 

C5cj) = -SC(/), (34) 
which upon using Green's theorem (Eq. 0) for the wavefield, we obtain 

<fy(x) = - J dx"G(x, x") [8C<t>] , (35) 
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Figure 8. Expected cross correlation (upper panels) and the source distributions (bottom panels). Stations are marked by the diamond symbols. Pathological 
distributions (such as the lower right) of sources can cause large shifts in the expectation value of the cross correlation because of the non-uniform manner in 
which waves illuminate the stations. In this case, the only source being the spot, is very close to the bisector line perpendicular to the path between the stations 
(i.e., y = 0), implying that the path difference between the spot and the stations is nearly zero. The cross correlation is thus almost (but not quite) centered 
around zero time. 



where < 



i and which may be rewritten in terms of the source distribution S(x.') as 

6<f>(x) = - J dx"G(x, x") SC^f dx G{x\ x) S(x) 



(36) 



This equation states that a source creates a wave at x', which propagates to a point x" as described by Green's function along that path, is 
singly scattered according to 5C and subsequently acts as a source, eventually propagating to measurement point x. This is the framework in 
which classical tomographic scattering is studied (shown in Figure[l3]). Substituting this into equation (|3TL 



J dxG*(x Q ,x)5*(x) jy dx"G(xi3,x")6£^f dx! G(x", x) 5(x)^ j 



(37) 



(jy dx!'G*(x a ,x!')5C* (y dx / G*(x // ,x / )5 , *(x / )^)| J dxG{x^x) S(x 



The terms in the flower brackets denote the scattering contributions and the terms within the square brackets show the direct wave arrival 
from the source to the observation points. Because we truncate the Born approximation to one term, i.e., considering only contributions from 
single scattering processes, the variation of the cross correlation consists of a direct wave pro pagating from a source to one of the stations, 
correlated with a singly scattered wave that propagates to the other measurement point. Akin to Gizon & Birchl ( 2002 ). we diagrammatically 
show the formal interpretation of the measurement in Figure Q3] In contrast to single- scattering theory applied to classical tomographic 
wavefield measurements (shown in Figure [T3ll, in which two Green's functions appear, the higher-order correlation measurement (regardless 
of whether these are "noise" or earthquake sources) requires the evaluation of three Green's functions. 
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Figure 9. The celebrated "ring of sources surrounding a station pair" scenario (left panel) and uniform distribution of weaker amplitude (right panel). Cross 
correlations at the stations (symbols) due to these distributions are over plotted (bottom panel). They are practically indistinguishable, and is part of the reason 
why the ring configuration of sources has been studied so extensively. 



Upon invoking the assumption of spatially uncorrelated sources, i.e., (5(x) 5'*(x / )) = £(x — x') cr(x) V(cj), we obtain 
(8C af i) = - J dx."G(xp,x")6£ ^ dx / G(x // ,x)G*(x a ,x)a(x)^ (38) 

-J dx"G*(x a ,x")<5£* ^ dxG*(/,x)G(x^x)a(x)?j , 

which produces a scattering diagram similar to Figure [14] except with coinciding points x, x', as shown in Figure [15] These kernels are 
indeed more difficult to compute than in the classical tomography case, and evidently require the evaluation of three Green's functions. The 
physics of these kernels is also conceptually different from the classical case. 



5 THE SENSITIVITY OF CROSS-CORRELATION ENERGIES TO ATTENUATION 



The topic of imaging wave attenuatio n using cross-correlation energies is a topic of interest (e.g., Cupillard & Capdeville . 2010l : Weaver et al. 



2011 : Prieto et al. . 2011 : Tsail 2011 ). The challenge is to accurately interpret enhanced decrements in cross-correlation energies amid effects 



of geometrical spreading with distance and wave- speed heterogeneities. Evidently, the distribution of sources significantly influences the 
conclusion of any inverse problem, and the problem of the determination of wave attenuation is no different. This suggests that the strategy 
typically followed in earthquake tomography, which is to first invert for the source and subsequently for structure (perhaps iteratively), may be 
applied equally to noise measurements. The standard trade-off between source and structure affects the interpretation of noise measurements 
as well. 

In Figure [16] we show the variation in energy (defined as the energy of the cross-correlation branch (l24l . positive or negative) of the 
cross correlation as a function of distance between the station pair for three source distributions. The variation in energy is entirely due to 
geometrical spreading and source distribution anisotropics. Note that the background model has no attenuation in this case. The scatter in 
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Figure 10. Sum of event kernels. The left panel shows the "true" source distribution, with respect to a nominal (uniform) value. Stations are marked by 
symbols. It contains a local spot of relatively large amplitude (500% increase over the uniform value) on the south-east quadrant. Because of the relative 
proximity of the station array to the sources, the event kernel is able to roughly localize over the spot. Since there is no coda (and complex structure) in this 
simple test case, the window encompasses the entire waveform. Thus the energy of the entire cross correlation contributes to the construction of the image of 
the noise source distribution. 



cross correlation energies for the anisotropic case may possibly be reduced by choosing an azimuthally varying normalization dPrieto et al 
200^) . 



The ring of sources case is seen to be different from the uniform distribution. This is because the in an attenuating medium, sources 
from farther away contribute less to the cross correlation. Therefore, the cross correlation energies in the ring and uniform cases converge to 
different expectation values, whose difference increases with the extent of attenuation. 

We also characterize the significance of a finite quality factor on energies when the network is illuminated by the source distributions of 
Figure Q2] We use a damping rate of 0.01 Hz, or a quality factor of roughly 150. Wave attenuation is modeled via solutions of the damped 
simple harmonic oscillator, i.e., operator © with a damping term 

dU + Yd t <t>- c 2 V 2 = 0, (39) 
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Figure 11. Sum of event kernels. The left panel shows the "true" source distribution, with respect to a nominal (uniform) value. Stations are marked by 
symbols. We include a local spot of relatively large amplitude (200% increase over the uniform value) amid the station array. Because stations surround the 
anomaly, they are able to accurately locate the distribution. Since there is no coda (and complex structure) in this simple test case, the window encompasses 
the entire waveform. Thus the energy of the entire cross correlation contributes to the construction of the image of the noise source distribution. 
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Figure 12. Sum of event kernels. The left panel shows the "true" source distribution, with respect to a nominal (uniform) value. Stations are marked by 
symbols. We include a local spot of relatively large amplitude (500% increase over the uniform value) moved farther away from the station array than in 
Figure [TO] Evidently, the stations are too far away from the sources to accurately locate the distribution. Since there is no coda (and complex structure) in this 
simple test case, the window encompasses the entire waveform. Thus the energy of the entire cross correlation contributes to the construction of the image of 
the noise source distribution. 



where attenuation Y has units of Hertz. Green's function for this operator is then 

G(x,x') = ^ (1) (^|x-x'|), (40) 

where the factor p — + iT/cj. Amplitudes evidently change but in entirely different ways, depending on the source distribution, as 
displayed in Figure \\7\ We show the percentage change in cross correlation energies due to the introduction of a spatially constant wave 
attenuation of 0.01 Hz. Figure [17] demonstrates that wave energies are indeed sensitive to attenuation, but extracting this information is 
subject to accurate knowledge of the source distribution. 




Figure 13. Single scattering in the classical earthquake tomography case. A source at x ; excites a wave that propagates to x /; where it scatters, acts as a 
source, propagating finally to station a. It is substantially simpler than the cross correlation measurement, which is depicted in Figures [14] and \l5\ Only two 
Green's functions are required to model single scattering in this scenario. 
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Figure 14. Scattering as captured by cross correlations for a general source distribution (similar to Figure [T} with non-zero spatial covariance. The first 
contribution (first line of Eq. [37] ) consists of a correlation between a wave generated at point x, propagated to a and a wave generated at x ; , scattered 
according to perturbation 5C at point x /x , propagated to measurement point (3. The second contribution (second line of Eq. (37)) is essentially the same except 
with points a, (3 reversed. This is the reason why three Green's functions are needed to formally interpret the measurement in terms of single-scattering theory. 

6 CONCLUSIONS 

Cross correlations are intrinsically more complex than classically used wavefield displacements. There are fundamental and meaningful 
differences between these measurements, which have consequences for the eventual solution of inverse problems. Because seismology is a 
precision science, it is important to formally interpret these measurements and capture their essence as fully as possible. Using a simple 2-D 
example, we have endeavored to delve into the physics of the cross correlation measurement. A goal of this article was to demonstrate the 
utility and ease of studying distributions of sources and posing the problem in terms of the expectation value of the relevant measurable. We 



Figure 15. Scattering as captured by cross correlations for a source distribution with zero spatial covariance (similar to Figure . The first contribution (first 
line of Eq. (38)) consists of a correlation between a wave generated at point x propagated to points a and x", with the latter scattered according to perturbation 
SC at point x" and eventually propagated to measurement point f3. The second contribution (second line of Eq. (38)) is essentially the same except with points 
a, /3 reversed. This is the reason why three Green's functions are needed to formally interpret the measurement in terms of single-scattering theory. The only 
difference between this figure and Figure[l4]is that the contributions to the cross correlation are from the same source point (i.e., when x = x ; ). 
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Figure 16. Cross-correlation energy (or amplitude; Eq. (24]) as a function of distance between station pairs (the network shown in the lower panels). Stations, 
marked by diamonds, are illuminated by a ring of sources (left), uniform sources (middle) and by a highly anisotropic distribution (right). The scatter in 
energies is entirely due to geometric spreading and source distribution anisotropy (for the anisotropic case). Amplitudes of every branch of every cross 
correlation are plotted (independent of orientation) in the upper panel for the three cases. They are normalized in all three cases such that the greatest value is 
1. Determining the source distribution prior to interpretation is strongly tied to the accurate interpretation of these measurements. However, it may be noted 
that i n the anisotropic case, the normalization could be chosen to be azimuthally dependent, resulting in energy shifts better suited to fitting ( Pri eto et all 
l2009h . 



make a case for the imaging of source distributions using measurements of cross-correlation energies. The dependence of these energies on 
station-pair distance and on wave attenuation is also touched upon. The influence of the source distribution on the energy measurement is 
demonstrated; cross-correlation energies unquestionably contain information about wave attenuation (primarily within the network) but it is 
hard to interpret them accurately without knowledge of the sources. 
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APPENDIX A: FOURIER CONVENTION 

The following Fourier transform convention is utilized 

/oo 
dte^git) = g(co), (A.l) 
-OO 

/OO 
dte iuJt = 2tvS(uj), (A.2) 
-oo 

± / dkje- mt g(u) = g(t), (A3) 



/OO 
due~ iu}t = 2ir6(t), 
-oo 



(A.4) 



where g(t),g(uj) are a Fourier-transform pair. The equivalence between cross-correlations and convolutions in the Fourier and temporal 
domain are written so 

/OO 
dt' /(*') g(t + t') h(w) = /* (w) sH, (A.5) 
-oo 

/OO 
dt' f{t') g(t - t') = /(w) ffH- (A.6) 

-oo 

The following relationship also holds (for real functions f(t),g(t)) 

/OO -| /* OO -| /» OO 

* /(*) 9(t) = ^J du /» = — y do- /(«) (A.7) 



